Statistical Analysis of Distance Estimators with Density Differences and Density Ratios

نویسندگان

  • Takafumi Kanamori
  • Masashi Sugiyama
چکیده

Estimating a discrepancy between two probability distributions from samples is an important task in statistics and machine learning. There are mainly two classes of discrepancy measures: distance measures based on the density difference, such as the Lp-distances, and divergence measures based on the density ratio, such as the φ-divergences. The intersection of these two classes is the L1-distance measure, and thus, it can be estimated either based on the density difference or the density ratio. In this paper, we first show that the Bregman scores, which are widely employed for the estimation of probability densities in statistical data analysis, allows us to estimate the density difference and the density ratio directly without separately estimating each probability distribution. We then theoretically elucidate the robustness of these estimators and present numerical experiments.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Density Estimators for Truncated Dependent Data

In some long term studies, a series of dependent and possibly truncated lifetime data may be observed. Suppose that the lifetimes have a common continuous distribution function F. A popular stochastic measure of the distance between the density function f of the lifetimes and its kernel estimate fn is the integrated square error (ISE). In this paper, we derive a central limit theorem for t...

متن کامل

Study on the effect of forest stand distribution pattern on results of different estimators of the nearest individual distance method

The Nearest Individual Sampling Method is one of the distance sampling methods for estimating density, canopy cover and height of forest stands. Some distance sampling methods have more than one density estimator that may be skewed to the spatial pattern. Unless the stands of the trees under study have a random spatial pattern. Therefore, the purpose of this study was evaluating the effect of s...

متن کامل

Analysis and Investigation of Landslide Hazard Zoning using Hybrid Model of Hierarchical Analysis and Surface Density

Identification of susceptible areas to landslide occurrence is one of the basic measures for reduction of the possible risk and hazard management. The main goal of this research is to compare the applicability of two statistical landslide hazard zonation models, valuing area accumulation and Analytical Hierarchy Process (AHP),in Ziarat Watershed, Gorgan, Golestan Province.In a review of previou...

متن کامل

Wavelet Based Estimation of the Derivatives of a Density for m-Dependent Random Variables

Here, we propose a method of estimation of the derivatives of probability density based wavelets methods for a sequence of m−dependent random variables with a common one-dimensional probability density function and obtain an upper bound on Lp-losses for the such estimators.

متن کامل

تأثیر الگوی پراکنش درختان بر برآورد تراکم با روش نمونه برداری نزدیک‌ترین فرد: مطالعات موردی در درختزارهای بنه زاگرس و توده‌های شبیه سازی شده

Distance methods and their estimators of density may have biased measurements unless the studied stand of trees has a random spatial pattern. This study aimed at assessing the effect of spatial arrangement of wild pistachio trees on the results of density estimation by using the nearest individual sampling method in Zagros woodlands, Iran, and applying a correction factor based on the spatial p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Entropy

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2014